User Modeling of Parallel Workloads

نویسندگان

  • David Talby
  • Dror G. Feitelson
  • Adi Raveh
چکیده

The goal of workload modeling is to simulate the expected workload, accurately enough to enable making correct design and administrative decisions. Several statistical features of production parallel computer workloads, which are not embodied in current models, have been identified. Their practical importance is demonstrated by two new kinds of schedulers – a key component in determining the overall performance of a parallel computer. The first is adaptive scheduling, which takes advantages of the locality of sampling and known cycles in parallel workloads, and achieves an average improvement of 10% in performance and 35% in stability for the tested production workloads. The second is shortest-jobbackfill-first scheduling, which relies on runtime prediction, done by analyzing user and session histories. These schedulers cannot be correctly evaluated by existing workload models, and we argue that the correct approach for future workload models (as well as on-line algorithms) is user and session-based modeling, instead of modeling jobs directly as done today. As the basis for such a model, we use PCA to provide variable sets which explain over 80% of the variance between users and sessions, and clustering to identify five stable session clusters and four stable user clusters. We then model the distributions of the arrival and activity patterns of both users and sessions, including a complete analysis of their dependencies and temporal structure. The model is based on logs from seven different parallel supercomputers, spanning over 87 months, analyzed together to ensure that results are location and architecture-neutral.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Workload modeling for parallel computers

The availability of good workload models is essential for the design and analysis of parallel computer systems. A workload model can be applied directly in an experimental or simulation environment to verify new scheduling policies or strategies. Moreover, it can be used for extrapolating and predicting future workload conditions. In this work, we focus on the workload modeling for parallel com...

متن کامل

Analysis and Modeling of Social Influence in High Performance Computing Workloads

Analysis and Modeling of Social Influence in High Performance Computing Workloads Shuai Zheng High Performance Computing (HPC) is becoming a common tool in many research areas. Social influence (e.g., project collaboration) among increasing users of HPC systems creates bursty behavior in underlying workloads. This bursty behavior is increasingly common with the advent of grid computing and clou...

متن کامل

Nursing Workloads and Psychological Empowerment in Hospitals: Structural Equations Modeling

Background: The high workload of nurses in hospitals has been identified as a patient safety and worker stress problem. Psychological empowerment is a motivational concept demonstrated in four dimensions: meaning, competence, self-determination, and impact. Objective: This study investigated the relationship between nurses’ workloads and psychological emp...

متن کامل

Modeling Parallel System Workloads with Temporal Locality

In parallel systems, similar jobs tend to arrive within bursty periods. This fact leads to the existence of the locality phenomenon, a persistent similarity between nearby jobs, in real parallel computer workloads. This important phenomenon deserves to be taken into account and used as a characteristic of any workload model. Regrettably, this property has received little if any attention of res...

متن کامل

Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling

ÐScheduling jobs on the IBM SP2 system and many other distributed-memory MPPs is usually done by giving each job a partition of the machine for its exclusive use. Allocating such partitions in the order in which the jobs arrive (FCFS scheduling) is fair and predictable, but suffers from severe fragmentation, leading to low utilization. This situation led to the development of the EASY scheduler...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006